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Inference: A Research Review 



As Interest In the topic of the unoi lervable part of the reading 

process. conq)rehens1on. has grovm. several new areas of reading research 

have emerged. The present paper will examine wte of these factors. 

Inference vrtilch was an almost unheard of topic before 1970. Calfee and 

Drum (1979) note that 

Com|»rehens1on Is a conpllcated matter; It can be virtually 
synonomous with thinking. Trying to analyze the process 
of comprehension Is an Interesting challenge. Ue propose 
here two broad categories of comprehension tasks transllteral 
and Inferential. 

While many authors agree that there Is such a skill as drawing In- 
ferences, agreenent on a definition ts more difficult. A basic definition, 
representing many writers who do not deal with hierarchies of Inference, 
comes from Strqnge (1980). 

A questlon/anstpfer relationship Is te xtual 1y Implicit If both 
question, and answer are derived the text, but there 
Is no expressed grammatical cue tying the questions to the 
answer (p. 395). 

Frederlksen (1979) has developed much more specific and elaborate 

definitions about Inference and ^pes of Inferences. His broad definition. 

tiowever. adds speclflcl^ tc Strqnge's concept. 

The fifth and highest level of processing operates entirely 
on propositions, generating new propositions that are given, 
for example, from prior discourse, from discourse context, 
or from previously acquired knowledge about the world. Any 
such processes will be referred to here as Inference s. 

In elaborating on the nature of Inferences. Trabosso (1980) has 

defined two ways In which Inferences are m»de. The first Involves text 
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connecting: finding relationships between events In the text. The second 

Is slot-filling: adding missing Infonnatlon to enable text connecting. 

Trabosso reiterates the difficult of studying Infermce reported by many 

researches; "...the making of an Inference Is a highly automatic and 

largely unconscious process... (but) Is not a simple or obvious process" 

(p. 3). In spite of difficulties trtth observations, measurement and 

even definitions. Inference has been the topic of much research In the 

past ten years. That research will be analyzed In this paper. 

In 1917. Edward I. Thomdike conducted a landmark study In reading 

comprehension. The study was significant because It looked at the com- 

plexl^ of coiq)rehens1on for the first time; up to that point reading had 

been considered to be "...a rather simple compounding of habits" 

(Ttiomdike. 1917. p. 425). Thomdike gave two hundred sixth-grade pupils 

a series of simple paragraphs and asked them to write the answers to 

questions about them. The questions Included examples of main Idea. 

factual recall and Inference. Thomdike was Impresses with the extensive 

varied of answers to his questions, and found that the responses did 

not fit neatly into categories of errors. Rather, he drew conclusions 

about the nature of "correct reading" based on the analysis of responses 

(Thomdike. 1917): 

In correct reading (1) «ach word produces a correct 
joeanlng, (2) each such element of meaning Is given a correct 
weight in comparison trith the others, and (3) the resulting 
Ideas are exwlned and validated to make sure that they 
satisfy the mental set or adjustment or purpose for whose 
sake the reading was donb (p. 428). 

He went on to describe certain errors as failures to perform one of 

these operations. Thomdike concluded that correct reading was very 

closely tied to the reasoning process; thus, his title In reporting the 
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research Is ''Reading as Reasoning: A Stu4y of Histakes In Paragraph 
Reading." 

Before leaving the Thomdike stu<jEy. It Is Interesting to note that the 
three factors Thomdike describes might be viewed by present theorists as 
representative of the Interactive nature of reading. Thomdike fdimd that 
each word must produce a correct meaning (text factor) . which the reader 
must weigh correctly Iti relation to other words (reader factor), and then 
validate according to the purpose for reading (process factor). 

The reasoning factor Identified by Thomdike w*s supported In several 
studies designed to analyze factors of comprehension. Feder (1938) factor 
analyzed the responses of 700 college students on "comprehension ; maturity** 
tests. He 1def;t1f1ed two distinct levels of response: reading for Infor- 
mation and reading for Inference. Comprehension factors were examined by 
F. 6. Davis In two separate studies. The first stu4y (Davis. 1944). 
conducted In 1941 with college freshmen, concluded that the two most 
Important factors In reading were memory for word meanings and reasoning 
In reading (cmiblning the categories of weaving Ideas and drawing 1nfe**ences). 
The second stu4y 'm% a much more carefully controlled and planned In- 
spectlon of the reading process. In It. Davis (1968) administered a 
series of nine tests to 1.100 twelfth-grade students. He analyzed eight 
possible factors of comprehension to determine i^t percent of the unique 
variance each contributed. His categories and findings are summarized 
In Table 1. The percentages are averages from within-day and across-day 
ma cr Ices. 
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Table 1 

DAVIS CATEGORIES AND AVERAGE PERCDITAGES OF CO.<PREH£MSI0M FAaORS 
Categories Percait Unique Variance 



1, 


recall Ina word near^lnas 


32 


2, 


drawing Inferences about word meanings 


3.5 


3. 


factual recall 


10 


4. 


weaving together ideas 


5 


5. 


drawing Inferences from context 


20.5 


6. 


recognizing author's purpose, tone, 
etc. 


11 


7. 


Identifying author's technique 


5.5 


8. 


following structure of passage 


13.5 



Davis again Interpreted his data to support word meanings and drawing 
Inferences as the two factors accounting for over half of the unique 
variance. 

Spearrltt (1972) used Davis' 1968 study and applied the statistical 
analysis of maximini likelihood. He found support for the word knowledge 
factor, but felt that three other factors, drawing Inferences from context, 
recognizing a writer's purpose, and following the structure of a passage, 
were so closely related that th^ should be considered as a single factor: 
reasoning. 

Although these studies used categories Identified as Inference ability, 
the first. study to focus exclusively on Inference was conducted In 1930 by 
a professor at Ohio State University (Tyler, 1930). He was concerned with 
the objective measurement of all educational goals. His department bad 
adopt(>/1 the goal ^t each student should have "An ability to draw Inferences 
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front facts, that Is. to propose hypotheses.*' To measure students* ability 
to Infer. Tyler tested slx^-six students In a beginning zoology course. 
The students took two tests In the same day. one requiring production of 
an Inference, the second requiring recognition of an Inference. Students 
read a series of factual statements; they were Instructed to "Write... 
the one Inference which seems to you most reasonable In attempting to 
Interpret the facts** (Tyler. 1930. p. 476). A sample paragraph ard logical 
Inference were provided. Papers were collected, and the students were 
given the srae items In multiple choice fonn. One might expect a rather 
large interference between the Inference generated and the multiple-choice 
Items. However. Tyler found a O)rre1at1on of only .38. His Interpretation 
of this data was that **the ablHIy to select the most reasonable Inference 
from a given list Is not the same as the ability to propose an original 
Inference " (p. 477-8). Tyler did not consider the possibility that 
Inferences drawn In the first measure night have Interferred with the ability 
of students to select a correct Inference. 

Tyler deleted the multiple choice fonnat In favor of the generated 
Inference, and developed a procedure for evaluating such 1nf(>rences. He 
sorted Inferences Into five **equaNappear1ng Intervals." which he had each 
manber of the education facul^ use for sorting samples of student Inferences. 
He found three Instructors to be as reliable as the whole department In 
determining *'the quality of the original Inferences... proposed by the 
student" (p. 479). He further valfdatcd *he procedure with new faculty 
members, finding a correlation of their Judgement with that of existing 
members* to be .96. In another analysis. Tyler found that his method of 
testing inference had Jt very low correlation (r=.29) with a test of factual 
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knowledge over the same material. He concluded that Inference was not only 
a separate skill, but that it cmM be measured and "the merit of an in- 
ference Is not the purely subjective quali^ often claimed" (p. 479). 

Inference did not appear again In the literature as a separate con- 
cern until the seventies. The only mention of Inferential ablll^ was In 
studies such as Davis' (1944). where It was considered as part of the hier- 
archy of reading skills. Even these studies dealt with Jiature readers, 
rather than with children learning the basics of the reading process. As 
researchers began to look at young readers, they first focused on more 
observable behaviors such as word recognition for lists of words, nonsense 
words, syllables and even letters. Rate was easily measured, as were eye 
movements and other physical characteristics of the reader. It was not 
until the behavioral wave of research subsided that the complexities of 
comprehension were examined. Reading research was given added Impetus from 
such fields as phychollngulstlcs. artificial language and psychology. The 
seventies produced Intense discussion and theory about the nature of the 
reading process; the ttworles are presently being tested through research. 
The studies evolving from this atmosphere pertinent to Inference are re- 
ported here. 

Several Interesting studies of children's Inferential skills were con- 
ducted by Paris (Paris & Upton. 1974; Paris. 1975; Paris ft Lindauer. 1976). 
Tiemey and Cunningham (1980) sumnarlze the first study by noting that "Paris 
(1975) found young children are less able to relate their own background ex- 
periences In the process of Inferencing" (p. 46). This conclusion contrasts 
with the opinion of those who feel that young children do not have sufflclmt 
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background experiences to make the needed connection. Paris pursued this 
factor In his next stu4y (Paris & Upton. 1974). 'Paragraphs were read to 
twelve subjects at each of six grade levels, kindergarten through fifth 
grade. Pupils were asked to respond to four literal and four Inferential 
questions. The most Interesting conclusion, apart from anticipated main 
affects for grade and question type (p. < .01). was that younger children 
seemed to be capable of drawing Inferences. What separated then In ability 
from older ch11dr»i was that they did not do so spontaneously. 

In order to explore this conclusion. Paris and tlndauer (1976) con- 
ducted an experiment using subjects fma first, third and fifth grades. 
A total of forty-eight students read sent«)ce(i^re an Instrument was 
either explicit (The nan dug a hole with a shovel) or Implicit (The nan 
dug a hole.) They were then tested on the four Implicit and four explicit 
saliences by being g1v«i the cue (vritlch had been stated In only half of 
the sentences) and asked to recall the sentence, taalysis showd a signl- 
ficant difference between grades one and three, and one and five, but not 
between three and five. Indicating that the development of the skill 
occurred In the primary grades. By fifth grade, students perfonned 
equally well on explicit and Inpllclt instrument sentences. Paris and 
Lindauer performed a second experiment using s1xtee.i each kindergarten, 
second and fourth graders, in this design, all sentences laplled an 1n- 
strwnent. but did not state It. Questions were asked using as recall cues 
explicit Information (the stated subject, verb or object of the sentence) 
or Implicit (the Instrument). Again, younger children did not do as well 
on Implicit-cued sentences as on explicit ones. 
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The third experiment reported by Paris and Llndauer attenpted to 
remedy what they hypothesized as a lack of spontaneous production rather 
than an Inability to deal with Inferential msterlal. The researchers had 
ten first graders act out the Implicit sentences. The children were In- 
structed to perform the actions for "The nan dug a hole." Tiie results 
were dramatic. Overall recall ms siglflcantly Improved. Hare Inqwrtantly. 
the difference between explicitly cued and Implicitly cued sentences was 
eradicated, with subjects recalling 72% of the ei^llclt-cue sentences and 
709C of the lapllclt-cue sentences. Paris and Llndauer concluded that yoimg 
children were capable of drawing Inferences, but that they had no spon- 
taneous strategy. This observation has been supported In other areas of 
leamlns. such as memory for lists (Smimov ft Zinchento. 1969). It Is 
unfortunate th&t Paris and Llndauer did not use a large enough sample to 
report their results In terms of significance; and that no one has sought 
to replicate this study using controls. The Implications for teaching 
Inference slci:is seem promising. 

A number of studies have dealt with variations of the developmental 
process Identified by Paris. Hlldyard (1976) had sixteen first, third and 
fifth graders listen to passages, and sixteen third and fifth graders read 
passages silently. Measure were take-^ of Implications (defined as ex- 
plicit Inferences requiring no world knowledge) and for Inferences, both 
those related to world knowledge and contradictory to world knowledge. 
A "narked developmental difference" was found for Implications and for 
those Inferences that contradicted previous world knowledge. Reading 
facilitated Implication; listening facilitated Inference related to world 
knowledge. 
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Handler and Johnson (1977). In t^le1^ classic study of story structure 
and Its effect (»i recall, examined the type^ of Inferences children made In 
tenss of their appropriateness. Their subjects were first and fourth grade 
students, and adults. The data In Table 8. Proportlcm of Subjects Producing 
Additions During Recall. Total Kunber of Additions, and Proportion Occuring 
In each Classification. Is of particular Interest. All subjects Mere 
approximately equal In the proportion of "reasonable elaborations" (1st - 
.36; 4th - 'SI; adult - .35). There Mere developmental trends In two other 
categories, homver. Emphatic or redundant additions Mlilch were consistent 
wfth the original text Increased tdth age (1st - .15; 4th - .41; adult - .49). 
Irrelevant or structural "fillers" which were not text-based decreased vfltd 
age (1st - .40; 4th - •29; adult - .16). Handler and Johnson Interpreted tM&e. 
data In terms of yotmg children's need to fill In mlssln*^ story structure; 
It seems equally Important to note that the skill of making appropriate 
Inferences undergoes developmental changes In the years between first and 
fourth grades. 

In an unpublished stu^y dme In 1977. Stein (1978) exwnlned first and 
fifth tjraders' Interpretations and recall of stories with very discrepant 
Information (e.g.. a mean, selfish fox helps a weak bear catch a.id cook a 
fish). When asked for exact recall, first graders recalled only the story 
line, not the discrepant description of the fox. Fifth graders recalled 
both the story and the description; some even made some Inferences about 
the fox's motives. When asked to "fill In anything that Is missing." first 
graders again gave fairly verbatim t^call. with no mention of Incongruities. 
Fifth graders made Inferences as to the reasons for the Incongruous action. 
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and sought to provide resolution to t*\b story. Wille her report contained 
no statistical data, these findings have been replicated In other studies 
(Stein & Rienn. 197&). 

In one such study. HezMOrskl. $tetii,and Trabasso (1S79) looked for 
differences betMe«i lc1n*iergartM and third-grade stud«its* testing a total 
of 144 children. Their Mterlal «#as written In two fonts; the control 
passage presented a negative protagcnist* »4i11e th« added Inforeatlon 
passage gave an explanation of w^y tiie protanoglst acted as reported, 
naklnj that person less negative. Results of the questioning of each 
type of passage are reported In Table 2. 

Ta^le 2 

rx?ORTION OF CORREa IHFERBiCES HftDE TO PROBE QUESTIONS 
Inference type Control Added Infonwatlon 

K 3 K 3 . 

causal .36 . 38 . 64 .87(*) 

consequence .33 .33 .82 .96(*) 

reaction .89 . 97(*) .88 . 97(*) 

(*) significant at the^.05 level 

It appears that the low proportion of correct responses In the causal 
and consequence categories on the control passage Indicates a uniform 
difficulty In de* uig with awblgulty for first and third graders. However, 
the older students were better able to make use of added Infonntlon In 
resolving ^Igulty. 

A developnnntal difference at an even younger level wus reported tty 
poulsen. Kintsch. K1ntsch,and pronack (197g). They found that four and 
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six year old children can both make up stories frm pictures presented in 
logical order. However, v^en the pictures were presented in random order, 
the younger children reverted to simply labeling each picture. The older 
children tried to make sense of the pictures as presented, and made In- 
ferences about thoughts and eii>ot1ons In order to tie a story line together. 
This finding Is consistent with Stein's 1977 stu4y In Its developmental 
difference, but a perplexing d1screp«tcy exists. The "older" students 
In the Pou^.sen. et a1. stu4y were six; the younger children In Stein's 
stu4y were first grade, probably six or seven years old. The age titat 
Stein reported es unable to resolve Incongruity shc%ied Itself to be quite 
able to do so in the picture-story telling task presented by Poulsen et a1. 
One possible explanation Is that In the Stein stu4y. the story was read to 
subjects; therefore, they did not have the text for reference. Subjects 
In the Poulsen et a1. Investigation had concrete representation of the In- 
congruity, and could ponder and process with more ease. Also, the required 
task would tend to faclllt te Inference In the story-telling condition. 

« 

Telling a story, for a child who understands the construct of a slory 
(which both four and six year old children In the stu4y did), requires 
the logical connection of Ideas and events. In the case of the Incon- 
gruous pictures (those In scrambled order). Inferences were required to 
create a cohesive story. In ccmtrast. the Stein task was only loosely 
focused on resolving amblgul^. with Instructions to fill In missing 
Information. It is quite possible that with specific Inference probes, 
these younger children could have produced logical Inferences. The 
work of Paris and Lindauer (1976) supports this hypothesis. 
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Another mior category of Inferential research, besides Us develop- 
mental nature, has to do with the nature of Inferences madrf. Much of the 
current examination of Inference Is based on schema theory proposed by 
reading theorists (Anderson. 1977; Rumelhart & Ortony. 1977; Spire. 1977). 
Various systems for analyzing inferences have been devised, such as Collins. 
Brown and Larkln's (1977) Progressive Refinement Model, which "entails 
examining Inference hollstlcally. wItMn the framework of the reader's 
construction. eva1uat1<Hi. and perhaps revision of a scenario for the 
text** (Tiemey. Vaughn. & Bridge. 1979). Frederlksen's Taxonomy of Text- 
Based Inferences has been used In several research designs as a system for 
categorizing types of inferences (Frederlksen. 1977; Frederlksen. Frederlksen. 
KuDqi>hr«y. & Ottesen. 1978). An excellent, highly usable taxonomy was 
proposed by Warren. Nicholas and Trabasso (1979). The following group 
of studies adds to these taxonomies empirical data on the nature- of 
Inference. 

Goetz (1977) conducted a study of 184 subjects divided into ft ninth/ 
tenth grade group and an eleventh/twelfth grade group. Groups were 
further separated Into high and low verbal ablll^ sections. The material 
read was of two types: an Important version and an unimportant version. 
One story, for exmnple. presented the reader with the Information that a 
woman was late to the airport because she stopped to talk to her son. In 
the unimportant version, the narrative describes the outcome of this 
diversion to be less time spent In the airport. In the ln^rtant version, 
the taJk causes her to miss her flight, which crashes; thus the time with 
her son saved her life. Other text variables Included salience and 
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expllcttness. described by Goetz In the following passage: 

Salience of the premise Mas varied by changing the amount of jtetall 
or emphasis given to the material that cued the specific target In- 
ference. Expllclt ness of the target was varied In order to provide 
a control condition In Mlilch the target had been explicitly stated 
(p. «). 

There were six sets of eight passages with various combinations of 1m- 
portafice. salience and expllcltness. each from 500 to 520 words In 
length. The experimenter carefully matched these fiction passages, 
and arranged them to avoid an order affect. 

Subjects were asked five questions about each passage. The first was 
always an Inference <tuest1on. the last an "Importance" question. Inter- 
spersed were basic premise questions and questions used to control for 
general memory. Subjects also rated each question as 1) an exact quote. 
2) paraphrase. 3) directly Implied, or 4) consistent with story. An 
Interesting design strategy had the students perform a word recognition 
test after reading the passage and before answering the questions; this 
test served as a buffer for short-term memoiry as well as a control for 
readability of the passages (albeit a limited control). 

Main effects were reported for grade level and verbal ability. There 
was no main effect for Importance; however, there was an Interaction be- 
tween Importance and explicitly stated Items. Goetz reports this Inter- 
action: 

Simple effects analysis of this Interaction revealed that the 
Important passages produced significantly higher scores than 
the unla^rtant passages for the Implied condition (Important 
« .81. Unliportant ■ .74) p<.01. Thus the Importance of the 
target Inference had a sl^ilflcant effect on the probabll Ity 
that the Inference would be made. 

Salience was also 1n\*o1ved In an Interaction; low verbal ability students 
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inferred significantly more highly salient passages. Students seemed 
a»ar? that they were making Inferences, since they rated significantly 
more explicitly stated propositions as having been stated. In sunntary. 
It appears that Inferences are more likely to be made Mhen it Is Important 
that they be made. 

A stu4y designed to examine when and how Inferences are made was 
conducted by Spiro and Esposlto (1977). They required twenty educational 
psychology majors to read 250 word vignettes and answer ifuestlons. In 
the passages. A "PraTnatlcally Implied** B. and variable C .essened the 
extent of that Implied relationship. One group of students read the 
passages with C coming after the A-B relationship; the other group read 
C before A-B. The cmtrol group had no C statement. Seven questions 
were asked for each passage, with the following distribution; ' 

1) asked If B had been mentioned 

2) asked a yes/no question of B 

3) -6) other questions about the passage, designed to mask the Intent 
of the stu4y (not analyzed) 

7) C Information* 

Subjects further rated the certain^ of their responses on a six point 
scale. 

There were significantly (p < .015) more errors wh^ C was read after 
A-B than when C occurred before. In the C-after condition, subjects often 
reported that the Inference was not In the text (Bnnentlon errors) or that 
the opposite was true (B-lncorrect errars). Spiro and Espislto concluded 
that "predictable Information Is superficially processed and not stably 
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represented in memory.** While It appears likely from this experiment that 
readers arc more likely to make errors after the Inference has been made, 
then contradicted, design limitations lessen the general Izablllty of the 
findings. The authors report sample sizes of only four students In each 
of the experimental gr?"ps. and eight In the control group. While they 
report that group differences accounted for 23.7X of the reported variance, 
they did not specify whether this analysis was of the experimental versus 
control, betwem the two experimental s, or among all three. Four remains 
an almost In-onsequentlal sample size. Perhaps an even more damanging 
problem surfaces In examining the Instructions given the students and the 
time allowed for reading. Subjects were Instructed to read and understand 
each passage, and asked not to look back. However, th^ were given nine 
minutes for each 250 word passage. Some must have filled the eight minutes 
with rereading. If only out of boredom. Others might have processed the 
Incongruity of the C-after condition, and reread for resolution. 

Schrelner and Shannon (19C0) examined "The Recall of Explicit and 
Implied Propositions by Good and Poor Readers Using Three Types of Assess- 
ment Procedures." Tliey defined good readers as eighth grade students 
reading at a mean grade equivalent of 10.0, based on the California 
Achievement Test; poor readers were from the same grade reading at a 
mean grade equivalent of 5.0. The subjects were randomly assigned to 
code groups FR (free recall), AR (aided recall) or HC (multiple choice). 
Three "natural expository passages" were read; they were reported as 
having readabilities of 3.0. 4.0 and 5.0 on the Fry graph (although the 
Fry does not measure to tenths). 
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Main effects were found for ability, expllcltness and question type. 
Good readers performed better than poor readers, with means of 2.85 and 
1.45 respectively. Explicit propositions were recalled a mean number of 
tiroes 2.47; the mean for Implicit Items was 1.59. Means for multiple 
choice, aided recall and free recall were 2.32» 1.40 and .80. respectively. 
The authors reported surprise that even the high group recalled only 52X 
of the total prc^sltlons; however, this finding Is consistent with other 
research (Tlemey. Bridge. & Cera. 1978-7g). 

A major limitation of the stu4y Is the classification of good and 
poor readers, and subsequent testing based on that classification. It 
was not reported whether the scores frcmt the CAT were overall reading 
score, vocabulary, or comprehension section. If comprehension was a 
large part of the determinant score, then tite poor performance on com- 
(MrehOiislon questions was to be expected. However, a matter for greater 
c<mcern Is the readability level of the passages used. All passages were 
at an Independent level for the good readers. The poor readers, on the 
other hand, may have been reading at Instructional and even frustration 
levels, since their average reading ability was 5.0. This failure to 
match reading ability with passage difficulty Is a crucial limitation. 

The rule of prior knowledge on a readers ability to make Inferences 
was the focus of a study conducted by Pearson. Hansen, and Gordon (197g). 
Their population was "slightly above average** second graders, differen- 
tiated by strong and weak schemata for knowledge about spiders. Recall 
was measured by ^wh^" questions, which were judged to be more ecologically 
valid than fr^e recall. The two groups did not differ significantly on 
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average reading ability scores, as measured by the Metropolitan Achievement 
Test, or on IQ (weak Schema mean IQ « 114.8. strong group « 120.4). The 
sutujects were placed In schema groups based on their scores on an eight- 
question test of knowledge about spiders. Students answering five. six. 
or seven (mean = 5,8) were designated as having strong schemata. Students 
answering four ^luestlons were dropped from the stu<jEy; the remaining groups 
were significantly different at the .001 level. 

To these well-defined groups. Pearson et al. presented a basal 
selection (Lyons and Camahan. 1972) rewritten to Include extra Information 
about spiders, and a narrative line. The story had a readability level of 
2.8 on the Spache Readability Formula. Students read the passage once, 
silently, and could not refer to it during questioning. Examiners did not 
help with unknown words. Explicit and Inferential knowledge were measured 
during questioning. 

Results Indicated a simple effect for prior knowledge on Inferrable 
questions (p ^ .025). There was not a significant effect for prior 
knowledge on explicitly stated questions. Main effects for strength of 
prior knowledge and question type, both significant at the .01 level, 
are reported In Table 3. 

Table 3 

EFFECTS OF PRIOR KKOMLEOfiE AHO QUESTION TYPE 
explicit questions Implicit questions total 

strong 

schema 4.7 2.8 7.5 

weak 

schema 3.8 1.0 4.8 

average 4.25 1.9 
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In their discussion of the results. Pearson et a1. note that the 
findings are not consistent with the prediction that prior knowledge would 
affect both inferential and literal recall. However. It seents that the 
lack of significant differences or the explicit material is actually a 
validation of the reading ability/Intelligence control so carefully built 
Into the design. The stu4y showed that both groups learned stated facts 
equally well. The more difficult process of combining facts with prior 
knowledge (Inferencing) did prove to be dependent on level of prior 
knowledge. 

One (if the most widely referred to studies of the nature of Inference 
was performed by Connie Bridge as dissertation research, and reported by 
Tiemey. Brldge^and Cera (1978-79). The stu4y was concerned with the 
extent and type of Inferential processing of written, non-narrative 
discourse. Subjects were thirty-six third grade readers, divided Into 
groups of good and poor readers on the basis of Stanford Achlev^nent Test 
scores and teacher judgement. Children read orally a passage about 
dinosaurs, read o buffer passage to control for short-term memory, and 
responded to ^ree and probed recall. Probe questions were based on f!*ee 
recall res[ es. and avoided leading readers to Insights they had not 
generated. Frederlksen's system of text analysis for Text-Based In- 
ferences was used to categorize responses; his eight classes of Inferential 
op'jratlons were the basis for comparison of types of Inference between 
the two groups. 

Major findings Included the well-docunented point that good readers 
answer significantly more questions than poor readers, whether the questions 
are analyzed as a whole or for explicit and Inferential types separately. 
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Good readers tend to <!o everything In the reading task better than poor 
readers; that Is why their SAT scores and teachers* opinions single them 
our. as good readers. There Mas no significant difference for good or 
poor readers In the proportion of Inferred to explicit recalls. CHiere 
Is a discrepency in the reporting of this result. In the atstract of 
Bridge's dissertation, it ms stated that approximately 45X of the re- 
calls were Inference. In the Tiemesy, et a1. report, the proportion of 
Inferred to explicit recalls was statistically reported as .51 for all 
readers.) The third major finding was that during free recall, both 
groups recalled about twice as nany explicit propositions as Inferred. 
In the probed condition, more Infermces than literal recalls were 
recorded. 

Possibly the most Important contribution this stu4y makes Is about 

the nature of Inferences mad(^ by good and poor readers. The discussion 

section explains: 

...good readers generated more causal and conditional OHinectlons 
between propositions, while poor readers tended to substitute more 
general concepts for the specific terms used In the passage. The 
good readers* use of causal and conditional Inferences seemed to 
add to the organization and coheslveness of their recalls. The 
poor readers* use of superordlnated concepts seans to result In 
a loss of accuracy and specificity (p. 564). 

Thi^ fii.dfng Indicates the need for more specific criteria for 
evaluating Inferences. The Warren et a1. system Is more accurate In 
assigning Inferences on the basis of correct Inferences, as well as 
on their basis In text. Warren et a1. (1979) point out the necessity 
of applying a "relevancy cr1ter1<Hi" to certain categories. 

There are several problems affecting the general liability of the 
study. There was no matching of groups, nor reporting of group means 
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for such relevant data as Intelligence or verbal ability. The star Jardi zed 
test wa'. not reported; groups were assigned according to percentiles (good 
" above slx^-elghth percentile, poor > below thlrty-eighth percentile; 
these are not equivalent). It was reported that ohe expet^lmenters dropped 
those who scored below d4t word recognition on the experimental material. 
That Is a terribly low vnrd recognition criterion; it is most llke'ly that 
many of the poor readers were reading at a frustration level. Sid their 
necessary attentKm to word recognltKm and analysis interf'e<*e significantly 
with their ability to comprehend? Did the fact that the passage was read 
orally Interfere with ^ comprehension of either group? 

In discussing the findings, the authors hypothesized that "maybe with 
probing the good readers were willing to extend their recalls beyond the 
explicit." It also seems likely that students, conditioned by school, 
were tuned into the "right answer syndrome" which emphasizes the recall 
of literal Information (Durkln. ig78-79). There may be another factor 
Influencing the reversed proportion of explicit/Inferred recalls 1.. the 
probed condition. It Is likely that the questioners asked questions 
posed to elicit Inferences, since that was the focus of the stu4y. It 
Is also logical to assume that the good readers, especially, had already 
recalled most of the explicit Information they were capable of re-sen^erlrg; 
Inference was the next level of processing (Taba. 1965). 

In ig7g. Tiemey and Bridge performed further analysis on Bridge's 
data. They examined Frederlksen's Text-Based Inference categories, and 
found no statistical differences ac/oss categories. Ihey then reexamined 
the responses of nine poor and nine good readers acccnJIng to five 
additional categories. These Included three of Frederlksen's Functional 
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Ccmtexts of Inference: generalization* deletlcm and Integration. Also 
Included were vanDljk's macro-rules for extension and connection. No 
significant differences were found for generalization* extension or 
connection. Results were significant for the finding that poor readers 
deleted more propositions than good readers (p < .05). and that good 
readers Integrated more pn^sltlons than poor readers (p < .01). 
Tiemey and Bridge conclude that "the ability to Integrate propositions 
Is essentially what differentiates 'control' of Inferred processes by 
good and poor readers'* (p. 32). Hhlle this seens to be an overslmpllfl* 
cation, even an overgenerallzatlon from the data. It Is certainly a 
variable worth Investigating. Further research should differentiate 
betwem correct and Incorrect (or appropriate and Inappropriate) 
Inference categories to simplify Interpretation of data. 

A final category of Investigation concerning Inference Is the more 
pragmatic research which asks how Inferential ability can be Improved. 
This research will be presented briefly, since It Is peripherally related 
but not the central focus of this papet. Nardelll (1957) failed to find 
significant gains 1ii sixth ffrade pupils who participated In a short unit 
of Instruction on drawing Inferences. Wong (1^) found that learning* 
disabled students were better able to draw Inferences when prompted 
recalls were used; however, no gain In actual ablll^ was measured. 
Tiemey and Cunningham (198D) report that an unpublished doctoral 
dissertation (Gordon, 1979) compared ti.ree treatments on Increasing 
Inferenclng. Treatment one sought to build prior knowledge and awareness 
of text structure; the second taught Inferenclng strategies directly; 
the third treatinent was actually a control utilizing a "language-related" 



ERIC 



23 



curriculum. Better transfer ms reported for direct Instruction* although 
this was not the result hypothesized. Although their results with college 
students msty not generalize. Stelngart and Block (1979) found Imagery 
to have a more facilitating effect than mere repetition for making In- 
ferences. Second graders studied by Hansen and Pearson (1980) made more 
Inferences, and the learning transferred better* When they were part of a 
group where Inferential questions were asked regular^. This group was 
superior to a "strategy* group thst ms trained In the process of 1n- 
ferenclng from tlwlr own experiences. 

This paper has been an attempt to analyze two main branches of Inferen 
tial research. Inference as a developmental process* and the nature of In- 
ferences made during reading. Inference has been Identified as a unique 
component of comprehension (Davis. 1944 t 1968; Spearritt* 1972). Tyler 
(1930) established that Inference was a skill that could be objectively 
measured. When several different ages and ablll^ levels of readers were 
examined, differing patterns of Inference emerged. Paris conducted 
several experliRsnts showing that young children are capable of producing 
Inferences* but that th^ usually do iwt do so spontaneously (Paris & 
Upton. 1974; Paris. 1975; Paris & Lindauer* 1976). 

Mblle almost every sttM|y of Inference has shown that older chlT^iren 
spontaneously produce nore Inferences (as do better readers)* and that 
most readers give more literal recalls than Inferential ones* othitr 
significant differences have been noted. Many of these differences deal 
with type or quality of Inference as a developmental variable. Hlldyard 
(1976) found that there was a "marked developmental difference" In the 
ability to draw Inferences that contradicted previous world knowledge. 
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Younger children pixKluced more irrelevant Inferences, while older readers 
tended to have sore appropriate additions In Handler and Johnson's 1977 
stu4y. Third graders proved to nake better use of added Information In 
drawing Inferences to solve anri>1gu1^ than did klndergartoters (Nezworskl. 
et a1.)* This younger age g-^N:p. however, proved capable of producing 
Inferences to solve ambiguity when presented with scr«ad>1ed-order pictures, 
a task that prowd too difficult fdr four year old subjects (Poulsen. et a1.). 

Examinations of the nature of Inferences made by rttaders of connected 
discourse have ntt been as conclusive as the developmental studies. With 
the hypotheses that Importance, salience and expllcltness would affect the 
reader. Goetz (1977) found no main effects for Importance or salience. It 
did appear, from reported Interactions, that Inferences are more likely to 
be made when th^ are iMportant to the 'toiry; salience mos reported to 
enhance tne Inferenclng of students with low verbal ability. Inference 
appeared to be superficially processed, without stability In memory. In a 
study by Spiro and Esposlto (1977). Prior knowledge seems to have a. 
facilitating affect on Inferential processing (Pearson, et a1.). In 
a major contribution on the nature of 1nf«renc1ng. Tiemey. et a1. (1978- 
7g) discovered that good readers made more causal and conditional connections 
between propositions; poor readers more often overg«iera11zed, Tiemey and 
Bridge (197g) later reported that the key to good reading was the ability 
to Integrate propositions. 

Here and there. In bits and pieces. Inference Is being established as 
a measurable, definable skill. Perhaps the only point of unanimity Is 
that Inferential ability vital In the mature reader, ''hose who are 
devoting their energies to- producing mature readers have accepted the 
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challmge pres^ted by Thomdike over half a century 990: "It Is not a 
small or uriwrttiy task to learn 'what tlt« book says'" (Thondike, 1917* 
p. 434). 
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